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Section  1.  Introduction 


The  problem  of  sequentially  sampling  two  populations  with  unknown  means  so 
that  the  sum  of  observations  is  maximized  has  been  formulated  by  Robbins 
(1952).  For  the  binary  case  (success/failure  trials),  pi ay- the -winner 
strategies  have  been  shown  to  produce  better  results  than  a  random  selection  of 
populations,  in  the  sense  that  the  fraction  of  inferior  selections  approaches 
the  constant  qA/(qA+‘3B)  where  qs^  <  qg  are  the  failure  probabilities  (Robbins 
(1952),  Zelen  (1969),  Wei  and  Durham  (1978)).  A  randomized  play-the-winner 
plan  has  been  used  for  the  assignment  of  patients  to  treatments  in  a  controlled 
clinical  study  of  a  potentially  life-saving  medical  procedure  because  of  its 
tendency  to  put  more  patients  on  the  better  treatment  (Bartlett  et  al  (1985), 
Cornell  et  al  (1986)). 

The  purpose  of  this  paper  is  to  present  a  sampling  procedure  in  which  the 
fraction  of  inferior  selections  approaches  zero,  in  general,  v/henever  the  ob¬ 
served  response  is  a  positive  integer-valued  random  variable.  The  main  idea  is 
to  generate  new  samples  on  the  two  populations  according  to  the  cumulative 
response  observed  on  each,  as  is  done  with  the  play-the-winner  rules,  but  modi¬ 
fied  so  that  the  sample  sizes  for  the  two  populations  are  independent.  The 
successive  samples  then  correspond  to  the  generations  of  two  independent 
Galton-V.'atson  branching  processes  and  the  attendant  limit  theory  applies. 

Based  on  the  observed  successes  on  the  two  populations  with  a  binary  response, 
a  conditional  test  of  hypothesis  is  given  along  with  explicit  bounds  on  the 
povjer  function.  While  other  methods  for  dealing  with  the  binary  trials  exist 
in  which  the  fraction  of  inferior  selections  go  to  zero  (Bather  (1981)),  they 
do  not  seem  to  have  as  tractable  an  inferential  structure. 


Section  2.  Regenerative  Sampling  with  a  Positive  Response 


A  sequence  of  stopping  times  called  generation  points  are  defined  for  the 
observations  on  each  population  independently  of  the  observations  on  the  other 
population.  For  the  population  i  =  A,B,  let  be  independent 

and  identically  distributed  (i.i.d.)  random  variables  taking  positive  integer 
values  having  a  common  mean,  m^.  The  R^  correspond  to  the  observed 
responses  on  population  i.  Beginning  with  an  initial  sample  of  size  u^,  a 
positive  integer,  the  sequence  T^  of  generation  points  are  defined  by 


(i: 


T  1  =  U.  +  R,  +. 
n+1  1  1 


.+  R 


for  n  >  1. 


Mote  that  the  generation  points  are  defined  separately  for  each  population 
and  the  detailed  specification  of  the  order  of  selection  is  left  open.  It  will 
be  seen  that  the  observations  between  gc  .eration  points 


^0 


u. 

1 


(2) 


{  1  -  T^  ,  for  n  >  1, 

n  n+1  n  - 

form  the  generations  of  independent  Galton-Watson  branching  processes  for 
i  =  A,B,  regardless  of  how  the  samp''es  are  ordered. 

The  sampling  scheme  with  random  sampling  order  within  each  generation 
may  be  visualized  as  an  urn  model: 

Two  urns  are  given;  Urn  I,  a  sampling  Uii.,  and  Urn  II,  a  holding  urn. 
Initially,  u,  balls  of  type  A  and  u^  balls  of  type  B  are  placed  in  the 


sampling  urn.  To  begin  the  first  generation  of  sampling,  a  ball  is  dravTi  at 
random  from  the  sampling  urn  and  its  tyfjo  noted.  An  observation  is  then  made 
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on  the  population  indicated  and  the  response,  R  >  1  is  recorded.  A  total  of 
R  balls  of  that  type  are  then  placed  in  the  holding  urn.  The  process  is  re¬ 
peated  until  the  sampling  urn  is  empty.  That  is  the  end  of  the  first  genera¬ 
tion  of  sampling.  To  begin  the  second  generation,  all  the  balls  in  the  holding 
urn  are  placed  in  the  sampling  urn  and  sampling  begins  anew.  The  analysis  to 
follow  is  based  on  the  ball  populations  at  those  points  where  the  sampling  urn 
becomes  empty.  They  are  the  generation  points,  T^. 


Theorem  1;  Assume  m^  >  m^.  Then  as  n  tends  to  infinity,  the  fractions  of 

inferior  selections 

T^ 
n 

n  n 

n 

.,B 

Z  +  Z 
n  n 

approach  zero  with  probability  one. 


Proof.  Temporarily  suppressing  the  population  superscript  i  for  A,B  let 

Z^  =  be  the  nth  stage  sample  size.  Then  Z^  may  be  expressed  as 

X.  +...+  X_  where  X,  s  ,  are  iid  with  and  independent  of 

x,n  Z.,n  k,nTi+k  1 

n-1  n-1 

Z^_^,  n  >  1.  Thus  represents  the  nth  generation  of  a  Galton-Watson 

branching  process  initiated  by  Z^  =  u  ancestors  and  having  offspring  distri¬ 
bution  equal  to  that  of  R^  (Harris  (1963)).  The  generation  points 
T^^^  =  Zq  +...+  Z^  are  the  cumulative  progeny  up  to  the  nth  generation.  It 
follows  that  the  expected  generation  size  is 

urn  if  m  <  “ 

(4)  EZ  =  .  ,  m  =  ER, 

n  '1 

V  ”  if  m  =  “ 

and  the  average  sample  number  (ASN)  is 


4 


(5) 


ASN  =  ET  = 


nu 


m  -1 
m-1 


if  m  =  1 


if  m  <  “  and  m  >  1 


if  m 


Tv  prove  (3),  we  first  note  that  by  assumption  >  1,  we  can  choose  an 


integ.'T  M  so  big  that 

i  6) 


s  ER^  I  .  > 

n- 


★  ^  jlr  * 

Let  K  =  R  I  .  and  define  T  ,Z  in  a  similar  manner  with  =  u, . 
n  n  n  n  *  A 

n- 

Ihen  for  all  n  >  1 
(7) 


>  T*. 
n  -  n 


z  z 

Next,  it  is  well-known  that  ,  n>l}  and  {— ,  n>l}  are  martingales. 

Z  Z 

Since  ^  ~  '^A  ^  ^  ^  ~n  ~  martingale  convergence 

% 

z  z  . 

n  n  * 

theorem,  —  and  —  converge  with  probability  one  to  random  variables  W 

B  * 

and  W  respectively  as  n  tends  to  infinity.  Furthermore,  since  R  <  H, 

W*  is  a  strictly  positive  random  variable  with  probability  one.  In  view  of 


(8) 


^  -/■"i 

„4l  m.  .  n-] 


ik  n  * 

/m^  converges  to  V<  /(m^-1)  with  probability  one  as  n  tends  to  infinity. 
Similarly  if  m^  >  1 

if  "'b  ^  ^  Next,  choose  X  such  that  X  E(mg,m^);  then  by  (7) 


/m'  converges  to 


W®/(mg-l)  with  probability  one  and 


which  goes  to  zero  with  probability  one  as  n  tends  to  infinity.  This  also 
implies  that  =  o(T^  ,)  =  o(T^  +  Z^)  with  probability  one  since 


+  T®  1 
n+1  n+1 


n+1  n 


As  a  result,  it  follows  that 


Z^  +  Z®  +  Z® 

n  n  n+1  n  n 

converges  to  zero  with  probability  one  as  n  tends  to  infinity. 

The  following  corollaries  show  that  favorable  comparisons  need  not  be  res¬ 
tricted  to  the  same  generation  points  on  the  two  populations. 

Corollary  1;  If  m^  ^  "’b  positive  integers  d^  and  d^,  then 


+1’®, 

ndA  ndg 


T®.  -  T?_ 


ndg  (n-l)dg 


+  T  ^  —  T  ^ 


'a  ' 


n-1 ) d^ 


(12) 
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converge  to  zero  with  probability  one  as  n  tends  to  infinity.  In  particular 
if  nig  <  then  for  any  positive  integer  d, 


+  T® , 
n  nd 

(n+l)d  nd 
^n  ^(n+l)d  nd 

converge  to  zero  with  probability  one  as  n  tends  to  infinity. 

Corollary  2:  If  ^  some  positive  integers  d^  and  dg,  then  as 

n  tends  to  infinity 

B  A 

(14) 

,  <Cl)d^  -  ^nd^>/'Cl)d^  -  <d^>  ■ 

with  probability  one. 

Corollary  3:  If  >  n’g'  P  >  as  m 

f  I® 

n  n 


V  Z  +  Z 
n  n 

A  R 

If  ^  positive  integers  d^  and  dg,  then  for  any  p  >  0, 


as  n  > 


7 


to  that  of 


8 


S eclion  3  .  Es t iir.ation  in  a  Monotoni c  Branching  Proces s 


In  this  section  W3  shall  study  the  estimation  of  some  of  the  parameters  of 
the  separate  populations,  specifically  the  mean  responses  ir.^  and  the 
variances  p?  ,  i=--A,B.  As  each  separate  population  follov;s  a  Galton-Watson 
branching  process,  we  shall  suppress  the  superscript  and  subscript  i.  Let 

be  iid  positive  integer-valued  random  variables  v/ith  ER  =  m.  Let 
u  be  a  positive  integer  and 

(18)  Tq  =  0,  2.Q  =  u  and  =  u. 

For  n  >  1,  define 

Z  =  R„  , ,  +.  .  .+  Fw. 
n  A  -+1  T 

n-1  n 

(19)  =  u  +  +...+  R,^, 

n 

=  Z-  +. . .+  z  . 

0  n 

Notice  that  Zq  <  Z^  -  ^2  reason  we  shall  call  this 

Galton-Watson  branching  process  a  monotonic  branching  process.  For  each 
n  >  0,  let  be  the  p-field  generated  by  {Zq,..,,Z^}.  For  the  mean  m, 

W-;  shall  consider  the  following  two  estimators 

rr^n  =  — —  , 

(20)  " 

-  !nil 
^'n  Z 

n 

Ac-c  tv.'j  estiir.'.'tcrs  are  well-known  in  the  literature.  Leo  Dion  and  Keiding 


(L  /  ii  trie  references  therein.  Tlie  estir.utor  rn  is  a  maximLim  likelihood 

n 

e^i  iretcr  of  in.  The  folloveing  fact  concerning  the  strong  consistency  of  m^ 
c:.-;  :r.  is  v;:-!  1-knov.M. 


9 


'.W/ 

I  A'". 


Fact  1.  Assume  m  <  Then  with  probability  one,  as  n  », 


:2i: 


( i )  m  ->  m, 
n 


( ii )  m  ->  m. 
n 


hh 


mn 


•  * 


tv';::- 


In  the  following,  we  shall  study  the  Lp-consistency  of  and 

Definition.  6  is  an  L  -consistent  estimator  of  0  for  some  p  >  1  if  as 
- n  — p -  - 

n  ^ 


[22) 


El(e^-e) ^  0. 


To  establish  the  Lp-consistency,  we  first  develop  a  few  results  which  are 
interesting  in  their  own  right. 

Theorem  2:  Assume  that  ER^  <  ®  for  some  p  >  1.  Let  a  =  max(v»~)  •  Then 

2  P 

(23)  {IzJ;  -  m)  p,  n  >  1}  is  uniformly  integrable. 

n  ^ 

n 

Proof.  We  decompose,  for  some  K, 

(24)  R^-m  -  "  ™n’|R^<K)*  +  *®n'|R^>Kl  “  ^”n'lR^>Kl* 

'  \ 

Since  ER^  <  “,  for  all  e  >  0,  we  can  choose  K  so  that 


(25) 


E|Y^r  <  c- 


First,  let  s  >  max(2,p).  Then  by  the  Narcin)<iewicz-Zygmund  inequality  (see, 
e.g.  Chow  and  Teicher  (1978),  p.  356),  for  some  constant  B^, 


Z  n  n+1 

n 


2^/2  1 

Z  n  n+1 

n 


Therefore 


<  B^K^E  Z^^^  <  B^K  ^<  “>  . 

-  s  n  -  s 


X,^  +...+X,.  p 

n  ■^n+1 


,  n  >  li  is  uniformly  integrable. 


Next,  consider  the  Y's.  By  the  same  Marcinkiewicz-Zygmund  inequality,  we  have 

+l+--*+Y,p  P 

„  n  n+1 


^  Z  ^  n  n+1 

n 


B^E  —  E  Y„  . . -  1-^  II- 

p  „ap  '  T  +1 '  '  T  1  '  '  n 

^  Z  ^  n  n+1 


+  •••+  K  if  1  <  P  <  2, 


zP/2-1 

Bi’  E(-i^ -  E(  |y^  t 

p  „ap  '  T  +1 
Z  ^  n 

n 


E(  |y^  +...+  1y^  |P|f  )),  if  p  >  2, 


<  bP  G  E  =  G  bP, 

-  P  Z.  P 


which  can  be  made  arbitrarily  small.  Therefore 


(29) 


n  >  i 


s  uniformly  integrable. 


Combining  (27)  and  (29),  we  have  the  desired  result  (23) 


Corollary  4:  If  ER^  <  “  for  some  jr  >  1,  then 


(i)  {  “  ni|^,  n  >  1}  is  uniformly  integrable. 


(30) 


(ii)  {  n  >  1}  is  uniformly  integrable. 


Proof,  (i)  Since  |m^  -  mp  <  jz^  ^(  -  m)  p,  (Z  >1  and  a  <  1) , 


the  result  follows  from  Theorem  2. 


(ii)  By  (i),  {|m^|P,  n  >  1}  is  uniformly  integrable.  We  note  that 


T  1  T  +Z  Z  Z 

/  -5  1  \  n+1  n  n  n  n  .  -  n 

(31)  — -  =  — ^  =  1  +  —  <  1  +  2 - 

n  n  n  n-1 


Therefore  {(-^^)^,  n  >  1}  is  uniformly  integrable  and  it  follows  that 
■^n 


Tn+i-u  p 

(32)  {| — = -  m|  ,  n  >  1}  is  uniformly  integrable. 


n 


Corollary  5;  Assume  ER^  <  ®  for  some  p  >  1.  Then  as  n  ->  <» 


(33) 


(i)  E|m  -  mp  0,  Lp-consistency, 


(ii)  Elm  -ml*^-»0,  L^-consistency . 


Proof.  The  result  follows  from  Fact  1  and  Corollary  4. 


Corollary  6:  If  ER^  <  °°  for  some  p  >  2  and  a  =  Var  R  e  (0,™), 
then  as  n  ->  <», 


-X  /2 


(34)  Elz^(^-m)|P.  oP  rjxjP^dx, 


and  for  any  positive  odd  integer  j  <  p, 


1  Z  ^ 

(35)  E(Z^(-|±i  -  m))^  0. 


Proof.  Since  as  n  -»  “ 

(36)  Z'J(-^  -  m)  5  N(0,a^) 

n  o 

n 

(See  e.g.  Nagaev  (1967)  or  Dion  (1974),  the  results  follow  from  Theorem  2. 

Next  we  shall  establish  a  result  analogous  to  Theorem  2  for  T^.  We  need 
the  following  elementary  lemma. 


Lemma  1:  Let  V,V^,...  be  iid  positive  random  variables  with  P[V  >  1]  =  1 
and  EV  =  m  >  1.  Then  for  all  q  >  1,  there  is  an  a  with  a  e  (0,1)  such 
that 

n>l  1  n 


Proof.  By  the  strong  law  of  large  numbers,  as  n  ->  <» 

n  .  1 


(38) 


^  —  <  1 
V,+. , .+V  m 
1  n 


with  probability  one.  Since  (n/(Vj^+. .  .+V^)  )^  <  1,  as  n 

n 


(39) 


E( 


- )^  i- 

V---^^n  m'^ 


by  the  bounded  convergence  theorem.  Therefore  there  is  an  a  <1  and  an 
integer  n^  such  that  for  all  n  >  n^ 


:40)  E( 


n 


V, + . . . +V 
1  n 


1^3  <  a*. 


Since  E(n/V^+. . .+V^) )^  <  1  for  n  =  l,...,nQ,  take 


(41)  a  =  max(cx  ,E(^)’^, . . .  ,E(^  ^ 


)^) 

0 


and  the  result  follows. 


13 


Theorem  3:  Assume  that  ER^  <  ®  for  some  p  >  1.  Let  a  =  max(j, 


for  any  r  <  p 
(42)  . 


T  T -u-mT 
n+1  n 


■'n-l 


,  n  >  1 


is  uniformly  integrable. 


Proof.  Let  r  be  less  than  p.  Choose  s  such  that  s  >  1  and  r 
Then  by  Minkowski  and  Holder  inequalities 

1 1/s 

(43) 


/ 

T  . -u-mT 

s' 

IT 

n+1  n 

Ej 

V 

Z  1 
n-1 

/ 

n 

<  ^ 
k=l 


n  ( 
<  ^ 
k=l 


/ 

t? 

"k-ll 

as 

h  "^k-1 

s' 

\ 

/ 

k-1 


1/s 


^k-1 


■■n-l 


as£.^ 


E>-s 


sp 


Zk  mZk_^ 


"k-1 


1/P 


Let  q  =  asp/(p-s).  Then  by  Lemma  1,  for  some  0  <  a  <  1, 

n-1  n-1  n-2 


(44) 


■'n-2 


If 

z  , 

'  n-2 

n-1 

'k-ljq 

< 

n-k 

a 

n-2 


Z,  -mZ 


k  k-1  □  D 

By  Theorem  2,  sup(E| - p  )^  <  C  for  some  finite  constant  C. 


k>l 


"k-1 


— ) .  Then 
P 


<  s  <  p. 


Therefore 


from  (43)  and  (44),  we  have 


14 


tr 

T  1 -U-mT 
n+1  n 

s' 

Lj 

n-1 

> 

n  ,  —  1  sp  ^ 

<  C  I  (a^  =  C  — -  <  — - 

-  W_1  £zl-  2zl 


1-a  ^  1-a 


and  the  result  follows. 

i^z-vr-/^!  T^r-tr  1  •  T  -f  ITdP  /oc 


Corollary  7:  If  ER^<®  for  some  p>l  and  a  =  max(-;^,— ),  then  for  any  r<p, 

^  p 


T  . 


1  1 1  — 

(46)  {  |t^  - in)|'^,  n  >  1}  is  uniformly  integrable. 


Proof.  In  view  of 
(47)  |tJ~ 


1  T  .  -u 

- m) 

n 


T  1 -u-mT 
I  n+1  n I 


the  result  follows  from  Theorem  3. 


Corollary  8;  If  ER^  <  ®  for  some  p  >  2  and  Var  Vi  =  c  e(0,®),  then  for 
any  r  <  p,  as  n  -»  « 

2 

,  T  1-u  I  ir  -X  /2 

4^  T-t^l  .  r*  r  CD  Ivl  o  ' 


1  T  1-u  I  r  -X  /2 

,48)  Ill’ll  °  dx 

n  s2ii 


and  for  any  positive  odd  integer  j  <  pf 

T  "“U 

(49)  E(T*^(^^i^ - m))^  ^  0. 


Proof.  Since  as  n  ->  «>, 

,  T  T -u  D  _ 

n+1  ,  2, 

Tj^( — - m)  +  N(0,ct  ), 

n 

(see  e.g.  Dion  (1974)  or  Jagers  (197^));  the  result  follows  from  Corollary  7. 
Remar  It  4.  We  conjecture  that  the  r  in  Theorem  3  and  Corollaries  7  and  8 
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can  be  improved  to  p,  in  which  case  p  >  1  can  replace  p  >  1  in  Theorem  3 

and  Corollary  7;  and  j  <  p  can  be  changed  to  j  <  p  in  Corollary  8. 

2 

In  the  remainder  of  this  section  we  shall  assume  that  Var  R  =  cr  which 

2 

is  finite  and  positive.  We  shall  study  the  following  estimators  of  a  : 

T  T  n  Z,  Z  - 

^  ^  -2  1  ^  „  ,  k  n  ^2 

0=7:^  - » 


-2 

1 

n 

y 

\-i< 

Z 

n 

0  — 
n 

n 

L 

k=l 

^k-1 

Z  1 
n-1 

~2 

1 

n 

y 

Vi< 

m)2 

c  — 
n 

n 

L 

k=l 

rj  /  • 

^k-1 

The  consistency  of  these  estimators  is  given  in  the  literature  and  we  collect 
them  into  the  following  fact. 

Fact  2. 

-2  2 

(i)  Heyde  (1974).  As  n  a  with  probability  one. 

(51) 

"2  2 

(ii)  Dion  (1975).  As  n  ->  ct  in  probability. 

In  the  following,  we  shall  study  the  Lp-consistency  of  these  two  estimators 

^  2 
of  0  . 

Theorem  4:  Assume  that  “  for  some  p  .*  1  and  let  a  =  max(i-). 

-  r  -  2  p 


(52)  {  |n  ( cr^  -  <7  )  r  /  n  >  1}  is  uniformly  integrable. 


(52)  {|n^  ^(7  -  0^)|P 
Proof.  For  each  k  >  1, 

(53)  E(Z,,  ,(-2i- 


k-1 


k-1 


since  ER^^  <  Theorem  2  implies  the  uniform  integrability  of 
Z,. 


^k-l'z, 


-  m)  )*'  ,  k  >  1}.  By  Lemmas  1  and  2  in  Chow  and  Yu  (1984), 
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.«  li  ^  I,  0  0 

(54)  { |—r  ^  (Z,  ^ (r -  m)  -  a  )|^,  n  >  1}  is  uniformly  integrable, 

n®  k=l  ^k-1 

which  is  the  desired  result. 

Theorem  5:  Assume  that  <  «>  for  some  p  >  1,  and  let  a  =  max(j,^). 


{  jn^  ~  ^  ^  uniformly  integrable. 


Proof .  We  decompose 

/cc\  l-a,-2  2. 

( 56 )  n  (  -  CT  ) 

1  n  Z,  0  T  Z  -  n 

"  “a  ^^k-l^Z.’T  ~  -  a  )  +  (m  -  ^— -)  Z  Z 

n  k=l  k-1  n-1  k=l 

n  Z  -mZ  ^ 

-2  (Z  -  Z  m)( 

k=l  ^  ^  n-1 

In  viev;  of  Theorem  4,  it  suffices  to  show  the  uniform  integrability  of 

(i)  ^  ^k-ll^' 

n  n-1  k=l 


Z  -mZ 


4  i  «->  IlUJ  « 

(ii)  {|  —  J:  (Zj^  -  mZj^_^)(-^^)  |P,  n  >  1}. 

n  k=l  n-1 


For  (i),  by  Theorem  2  (for  some  finite  constant  C)  and  by  Lemma  1  (for  some 

a  z  (0,1))  and  (44)  we  have 

1  mZ  .  -Z  _  n  Z,  . 

(58)  E(^((  ^  ^)^)P(  I  ^^)P) 

n^P  Z'^  .  k=l  ^n-1 

n-1 

1  n  Z,  .  mZ  ,-Z  _ 

E(  |-Jl=i-n|2PlF  ,) 

n^P  k-l  Vl  z"*  '  ' 


<  -j-  E(E  ^^)P  <  -^  (E  (E(^)P)^/P)P 
n®P  1  ^n-1  n^P  1  ^n-1 


<  (I  „(n-k)/p,p  ^  _C - 1^  ^  0,  as  n  ^  - 

■  n®P  1  ■  n^P  (1-«1/P)P 


yielding  (i).  For  (ii),  by  the  seime  results  used  for  (i),  we  have 

-  n  (Z, -mZ,  .)  (z  -mZ  , )  [z^  T 
(59)  ^Ell  ^  Jizl  IP 


k=l 


<  Ji_  (  £  (eI^^  ”^k-l|P|\  "^n-l|p  ^k-1  p/2  1/p  p 

k=l  Iz  afz  T  ^n-1 

k-1  n-1 


1  ri  Z .  inZ  1  1  ^  1  ^  1  2 1  1  1  /■^ 

<  _!_(  £  (e|  '^~^|^P)^^^P(e|  ^  k-l^p^l/2p^p 


1 


■^1 


c‘*(£(E(J^)P)^/^P)P  < -^ - (-5 - aO  as  n  ->  » 

n  1  n-1  n  (1-a 


yielding  (ii).  And  this  completes  the  proof. 

Corollary  9;  If  <  ®  for  some  p  >  1,  then 

^2  2d 

(i)  { I -  a  1^,  n  >  1}  is  uniformly  integrable 

(60) 

”"2  2  D 

(ii)  { I CT^  -  CT  1^,  n  >  1)  is  uniformly  integrable. 

2  ~2  -2 

Proof.  Lee  a  be  either  c  or  a  .  Since  a  <  1, 

-  n  n  n  -  ' 

n^  ^  I  -  CT^  I  >  I  ~  \  srid  the  result  follows  from  Theorems  4  and  5 

Corollary  10:  If  ER^^  <  “  for  some  p  >  1,  then  as  n  ® 

(i)  E I  0,  Lp-consistency, 


ii)  e1<^^  ~  -*  0, 


Lp-consistency. 


"2  -2  2 

Proof .  By  Fact  2,  and  converge  to  a  in  probability  as  n 

Together  with  this,  Corollary  9  gives  the  L  -consistency. 


Corollary  11;  If  ER  ^  “  for  some  1  <  p  <  2,  then  as  n  “ 


(62) 


U:  If 

ER  ^  < 

Ed  ^ 

(i)  E 

|n  P  (a; 

(ii)  E 

In  P  (a 

n 


Proof.  By  Lemma  1  of  Chow  and  Yu  (1984)  and  (53),  as  n  “ 


EzI 

P  -  a2)|P  0. 


(63)  E|n  ^  (ct^  -  a 


By  (56),  (58),  (59)  and  (63), 


Eli 

P  _  2,  |p 


E I  n  '  ~  ^ 


0. 


Corollary  12;  If  ER^P  <  ®  for  some  p  >  2,  then  as  n  “ 

dx; 


E  _.2/. 

/c/ii  trl  ^^~2  2,|P  ^  -.2  2p  jjd  e  ^  ^ 

(64)  En"(a^-a)E-^2  ' 


ilii 


and  for  any  positive  odd  integer  j  <  P/ 
(65) 

Consequently  as  n  ® 


E(n^(CT^  -  ^  0. 

n 


(66) 


-2  2  1 
E(a^)  =  +  o(— ), 


i.e.  the  bias  is  of  smaller  order  than  n  ,  and 


(67) 


„  ,-2,  2a  ,1, 

Var  a  )  ^  -  +  o(—) . 

n  n  'n 


Proof.  Heyde  (1974)  has  shown  that  as  n  ^ 

D 


(68) 


n'(apj  -  a  )  -»  N(0,2a 


By  Theorem  5,  the  results  follow. 


4.  Iinat i '/l  Play-rii  -V.innf^T 

/.s  n:!*n;nl  exl (.'nsicii  of  the?  Z'jlcn  (1969)  approacli  to  binary  comparisoiis 
1;  L  r.j  e  ,:prl  I  '  th?:  nur'  -'i.  of  trials  on  treatment  i,  i  =■  A  or  B,  until  the 
fiist  faiinrc'  is  chserved.  So  the  initial  generation  does  not  correspond  to 
a;',/  E't  ri:  trials  but  only  tci  the  nurrher  of  success  runs  to  be  observed  in  the 
flirt  g-.i' rial  ion  of  sampling  on  treabviont  i.  The  responses,  R^,  have  a 
c:-' rr.' 1 1  i  r  distiibution  virth  finite  mean  m.  =  1/q.  and  finite  variance 

j  1 

•*? 

c'  --  (l-qj^)Aj^  '-.'Iicre  q^,  is  the  probability  of  failure.  The  ASN  is 
in  ( 1 -c/f  )/q^*(  l--q^  )  over  the  first  "  ^'i  corresponding  to  genera¬ 

tions  1  through  n.  Note  that  at  least  nu^  trials  are  run  on  treatment  i, 
to;  thue  is  no  abrolute  upj.er  bourjd  on  the  actual  nun’^er  of  trials  for  any 
n  >  1. 

'i  nr'  C'.  t  inv'.tor  of  p^  =  1  -  q^,  =■•  1  -  l/m^ ,  reduces  to 

-  u.)/(T^  1  -  u.  )  vhich  equals  the  cumulative  nuirJoer  of  successes 

d.' .'id  .d  ).y  the  nun'.iir  of  treatnsonts  on  tre.ntment  i  in  this  scheme.  The 

fiaclicn  c£  inferior  treatment  scloctions  is  small  if  n  is  large  almost 

surely  and  in  the  sense.  ;^in  approximate  distribution  is  availcible. 

.Since  Vh(^  lim  zVnn*)  has  a  gamma  distribution  with  moment  generating 
n 

sN .  -u . 

function  E;(e  ^  ( 1  +  s/u^)  s  >  0,  (Harris  (1963)),  2u>7^  has  a  chi- 

iqU'O.rcd  distribution  v.'ith  2in  degrees  of  freedom.  Thus  the  ratio, 

P  p. 

h  (rd'i  Vc  fn)T  ,  h.iis  an  approx iii.a to  F  distribution  viit.  2u.  and  2u,^ 

('.'no-;:.,  cf  freedom,  where  the  constants  are  C^(n)  -  )/(l  “  q'^^'  ^)» 

i  A,Fr  .  F'u' therm arr',  by  Corollary  2,  with  probability 

as  n  ->  O'. 

lii'.'etu.  6  shows  that  L  convergence  obtains  as  well. 


Theorem  6:  Assume  that  1/q^  ""  ^  If  PlR^  =  x]  =  (l-q^^) 

x  =  l,2,...,  i=A,B,  then  for  any  p  >  0,  as  n  ->  “> 


(i)  E(-^)^  ->  0, 

^n 


(ii)  E(-^)P  ^  0. 

^n 

Proof.  Without  loss  of  generality,  assume  that  u^  =  u^  =  1  and  that  p  is 
an  integer.  Then 

E(Z^)P  =  r  xP(l-q")"-lq" 

x=l 

CD 

<  E  x(x+l) . . .(x+p-l)(l-qp)^~^qp  =  p!  q'^P 

X=1  B  B  B 


_,1  ,p  _  1  n,x-l  n 

\  ‘1-‘3a>  % 

Z  x=-l  x*^ 

n 

Pi  ,,  n,x-l  n  . 


1  ,,  n,x-l  n 


—  (1-q.)  qn  +  i:  “  (i-qn)  q^v 
x=l  xP  ^  ^  x=p+l  xP  ^  ^ 


.  n  ,  _  1  n,x-l  n 

-  P'^A  ^  x(x-l ) . . .  (x-p+1 )  Pa  Pa 
p+i 


n  ^'^pr  Pa 


,  for  some  constant  C. 


Hence  as  n  -■>  ® 


i)  E(-^)P  =  E(Z®)P  E(4-)"  '  - 


<  (  (E(Z®)P)-^/f\...  +  (E(Z®)P)^/P)P 


<  (1  +  (plq„P)^''^P+. .  .  +  (p!q  "P)^/P)P 

D  D 


y  1/1  ”  ^  \ 

<  p!  ( 1  +  +.  . .+  q.,  ) 

D  B 


%-!  (qe-l) 


ii)  E(^)P  <  E.B|P 

n+1 


Cnq^P  log  q 


o'  1  n 

^ -ifiTi  J^p  <P<^A  - 


->  0  as  n  ->  ®. 


”^A  ^ 

Corollary  13:  Assurr.e  that  q^  ~  '’V  ^  ~  some  positive 


integers  and  d^, 


A  A 


ii)  E(T^,  /i  ,  )P  ^  0  as  n  ->  “> 
nd„  nd. 

A  A 


Remark  5.  The  delicacy  of  the  above  result  is  noteworthy.  It  seems  plausible 


that  a  similar  convergence  obtains  in  non-geometric  cases  but  no  proof  is 


knov.Ti. 


Remark  6.  Certainly  other  selection  procedures  exist  for  the  present  case, 


Bather  (1901).  However  direct  comparisons  are  difficult  because  the  sample 


sixes  are  random  in  regenerative  sampling  but  are  fixed  in  Bather's  study. 
The  actual  implementation  of  the  trials  is  only  mildly  constrained  by 


by  the  requirement  that  the  generation  points  T^,  T®  be  reached  on 

both  treatments  for  some  fixed  value  of  n.  It  is  interesting  that  if  it  is 

assum.ed,  in  addition,  that  every  generation  point  up  to  the  nth 

be  reached  on  both  treatments  i  =  A,B  before  proceeding  to  the  next,  then 

the  present  scheme  can  be  visualized  as  an  adaptive  iteration  of  the  Zelen 

scheme.  In  particular,  let  (Z^,Z^)  =  FV7(u  ,u  )  denote  the  number  of  trials 

on  the  two  treatments  in  a  Zelen  type  experiment  which  is  stopped  when  u^ 

failures  are  observed  on  treatment  A  and  u  failures  are  observed  on 

B 

treatment  B.  Then  the  successive  sample  sizes  are  generated  recursively  by 
(Z‘^,Z®)  =  PVJ(u^,Ug),  =  ^(^k-l'^k-1^ '  ^  ^  2,...,n.  The  urn  scheme 

presented  in  Section  2  can  be  adapted  to  the  present  case  in  which  the  total 
response  is  spread  over  a  number  of  trials.  As  before,  select  a  ball  from 
the  sampling  urn  and  note  its  type.  Administer  the  indicated  treatment  and 
observe  the  response.  If  it  is  a  success  a  ball  of  the  corresponding  type  is 
added  to  the  holding  urn  and  the  selected  ball  is  returned  to  the  sampling 
urn.  If  a  failure  is  observed  then  the  selected  ball  is  simply  transferred 
to  the  holding  urn.  As  before,  a  generation  is  complete  when  the  sampling 
urn  becomes  empty. 

In  view  of  Fact  1  in  Section  3  as  applied  to  the  present  iterative 
play-the-winner  design,  it  is  plausible  to  base  a  test  of  hypothesis  about 


the  failure  rates  on  the  number  of  successes  observed  over  n^  sampling 

generations,  i  =  A,B.  In  particular,  f.-r  positive  integers  n  ,n_,  a  test  is 

A  B 

proposed  for 


‘O'  ^A 


"^A  ^B  "^A  “^B 

>  Qn  VS  H  :  <  q^ 


B  B  ..A  B 

where  S  =  -  u  .  The  random  variables  b  and  S  are  independent  and 

represent  the  number  of  successes  on  treatments  A  and  B  over  trials 

and  respectively,  if  n.  =  n_  =  1  and  u.,  =  u„  =  f 

the  test  is  equivalent  to  that  of  Zelen  (1969). 

Since  the  samples  on  the  two  treatments  are  independent  and  since  the 
geomietric  distribution  is  preserved  under  the  composition  of  probability 
generating  functions  in  the  Galton-Watson  branching  process  (Harris(1963) ) , 
S^,S^  and  S  have  negative  binomial  distributions  and  the  conditional  distri¬ 
bution  can  be  presented  explicitly.  For  r  a  non-negative  integer  and 
k  =  0;l;.../r^ 


(70)  g(k|r)  s  P[S  =  k|S^=  r] 


k+u  -1  r-k+u  -1  .  r 

=  (  ,  )(  1  )X^  /  E 

j=0 


j+u  -1  r-j+u  -1  . 


where  X=  ( 1-q^'^)  ( l-q^  ) . 

^A 

Under  q.  =  q_  ,  X  =  1  and  the  distribution  with  r  a  nonnegative 
A  B 

integer  and  x  =  0,1, ...,r,  is 


(71)  G(x|r)  H  p[S^  <  x\s  = 
The  a-level  test  is  proposed: 

(72)  Reject  in  favor  of 

In  the  simplest  case, 

Theorem  7 :  Assume  that  ^  ~ 


x 

r]  =  E  g(k|r) . 
k=0 


if  and  only  if  G(S^|S)  >  1-a. 

1,  the  power  can  be  bounded  as  follows 


n^  and  n^  are  positive  integers  and 


0  <  a  <  h.  Then 


(73)  (q  ^  +  (l-qg^)(l-q^  )®)  S  =  P[G(S^|S)  >  1-a] 

.  ,1  "a,  ,,  ”'^w,  ^A,e,-i 

<  (1-q  )(q  +  (1-q  )(l-q  )  )  , 

where  0  =  -  1  and  =  P[G{S^|S)  >  1-a], 

7  Remark  5^,.  The  lower  bound  is  exact  if  a  =  Jj,  and  for  a  small,  approximately 


^  ^  w,  >  D./n 

%  s  K„  <  /  li-q*  )• 


Proof.  Since  G(x[r)  =  (x+l)/(r+l),  x  =  0,1, ...,r. 

Thus  K  =  P[e(S^+l)  >  S®],  Since 
a 

(75)  P(Reject  H.|S®-  kj  -  P(S*+  1  >  ek) 

0k  0k  — 1 

and  the  latter  is  bounded  below  by  (1-q  )  and  above  by  (1-q  )  , 

is  seen  to  satisfy  the  bound  stated  upon  averaging  over  the  values  of  S  . 

In  view  of  the  interesting  outcome  of  the  clinical  study  by  Bartlett  et  al 
(1985),  Cornell  et  al  (1986),  performance  values  with  q  close  to  zero  are 
presented  in  the  Table.  Of  course,  the  power  is  conserved  upon  truncation  of 
the  favored  treatment  since  the  success  counts  are  cumulative.  So  the  null 
hypothesis  could  still  be  rejected  without  ever  completing  a  generation  on  the 
better  treatment.  If  it  appears  that  *3;^  “  ^  true,  as  in  the  afore¬ 

mentioned  study,  then  the  trial  may  be  concluded  at  any  point  after  the 
specified  generation  point  is  reached  on  treatment  B.  Of  course  at  least 
UjjUjj  trials  shall  be  run  on  treatment  B  with  regenerative  sampling. 
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